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(54) Audio information provision system 

(57) An audio information provision system for pro- 
viding a target with an audio information stream suitable 
to the target includes a database for storing a plurality 
of audio infomriation streams; an inherent condition input 
section for receiving an inherent condition of the target; 
a variable condition input section for receiving a variable 
condition varying in accordance with time; a selection 



section for selecting at least one audio information 
stream from the plurality of audio information streams 
based on at least the inherent condition and the variable 
condition; and an output section for outputting the at 
least one audio information stream. 
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Description 

FIELD OF THE INVENTION 

5 1 . FIELD OF THE INVENTION: 

[0001] The present Invention relates to an audio information provision system for selecting and providing background 
music which is suitable to the objective and image of various spaces, for example, commercial spaces such as depart- 
ment stores and other types of stores, public spaces such as hotels and offices, or private spaces where people can 
10 enjoy themselves such as automobiles and houses. 

2. DESCRIPTION OF THE RELATED ART: 

[0002] Conventionally, background music has been provided to commercial spaces and public spaces mainly as 
15 follows. Music selecting specialists select background music which is suitable to the image of each commercial or 
public space, and the selected background music is provided in the form of recording media such as CDs or the like. 
Alternatively, background music channels of cable broadcasting including a wide variety of music programs are sub- 
scribed to. 

[0003] The system of having specialists select suitable background music uses expert knowledge and results in a 
20 higii ievel of satisfaction of the users. However, this system is disadvantageously costly and thus can be accepted only 

by clients who highly appreciate the sales promoting effect of background music. In addition, the selected background 

music is not always suitable to the image and objective of the space or the type of customers or people within the space. 

[0004] The use of recording media such as CDs requires the user to play the same background music recorded on 

the recording media for a certain period of time. 
25 [0005] The subscription to cable broadcasting allows the user only to select a music genre, and the user is also 

required to play the predetermined programs of the selected genre for a certain period of time. 

[0006] As can be appreciated from the above, it has conventionally been difficult to provide a user with background 
music suitable to the objective or image of each individual commercial or public space or taste of people present in 
the space in accordance with changes in time, environment and other conditions. 

30 

SUMMARY OF THE INVENTION 

[0007] An audio information provision system for providing a target with an audio information stream suitable to the 
target including a database for storing a plurality of audio infomnation streams; an inherent condition input section for 
35 receiving an inherent condition of the target; a variable condition input section for receiving a variable condition varying 
in accordance with time; a selection section for selecting at least one audio information stream from the plurality of 
audio information streams based on at least the inherent condition and the variable condition; and an output section 
for outputting the at least one audio infonnation stream. 

[0008] In one embodiment of the invention, the database stores a plurality of related information streams respectively 
40 -related to the plurality of audio information streams. Each of the related information streams has a coordinate value 
representing a position of the corresponding audio information stream in a prescribed coordinate system which defines 
an impression of audio information streams. The selection section determines a coordinate value representing a po- 
sition of the target in the prescribed coordinate system based on the inherent condition and the variable condition, and 
selects at least one audio infomriation stream corresponding to at least one related information stream having a coor- 
ds dinate value which is located within a prescribed range from the coordinate value representing the position of the target. 
[0009] In one embodiment of the invention, at least one related information stream among the plurality of related 
information streams includes adjustment information which indicates that a distance between a coordinate value in- 
cluded in the at least one related infonmation stream and the coordinate value of the target is adjusted, based on at 
least one of the inherent condition and the variable condition. The selection section changes the coordinate value 
50 included in the at least one related information stream. 

[0010] In one embodiment of the invention, the audio information provision system further comprises a reserved 
condition input section for receiving a reserved condition indicating that a preset audio information stream is output by 
the output section at a preset time. The output section outputs the preset audio information stream at the preset time. 
[0011] In one embodiment of the invention, the audio information provision system further includes an economic 
55 condition input section for receiving an economic condition representing a desired cost for the at least one audio in- 
formation stream. The selection section selects at least one audio information stream, based on the economic condition, 
from among the at least one audio information stream selected from the plurality of audio information streams based 
on the inherent condition and the variable condition. 
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[0012] In one embodiment of the invention, the plurality of related information streams further include a plurality of 
physical feature information streams each representing a physical feature of the corresponding audio Information 
stream of the plurality of audio infomnation streams and a plurality of bibliographical information streams each repre- 
senting a bibliography of the corresponding audio information stream of the plurality of audio information streams. 

5 [0013] In one embodiment of the invention, the selection section is connected to the inherent condition Input section, 
the variable condition Input section and the output section via a communication line. 
[0014] In one embodiment of the invention, the target is one of a commercial space and a public space. 
[0015] In one embodiment of the invention, the target is an individual. The inherent condition represents inherent 
infonnation of the individual. The variable condition represents mood information of the individual. 

10 [0016] In one embodiment of the invention, the audio information provision system further includes an economic 
condition input section for receiving an economic condition representing a desired cost for the at least one audio in- 
formation stream; a mood information analysis section for analyzing the mood information and outputting a mood 
infonnation analysis result; and an individual information accumulation section for accumulating the inherent condition, 
the mood information analysis result and the economic condition. The selection section selects at least one audio 

15 information stream, based on the economic condition, from among the at least one audio Information stream selected 
from the plurality of audio information streams based on the inherent condition and the mood information analysis result. 
[0017] In one embodiment of the Invention, the mood Information analysis result and the economic condition are 
accumulated in the individual information accumulation section as individual preference information representing an 
individual preference. The individual preference information is updated each time the mood information analysis result 
20 and the economic condition are input to the individual information accumulation section. 

[0018] In one embodiment of the Invention, the audio information provision system further includes a satisfaction 
degree information input section for receiving satisfaction degree infonnation representing a satisfaction degree of the 
individual for the at least one audio information stream. 

[0019] In one embodiment of the invention: the individual information accumulation section accumulates a past se- 

25 lection result provided by the selection section. The audio infomnation provision system further includes a feedback 
section for presenting to variable condition Input section, as individual preference Information representing an individual 
preference, the past selection result accumulated in the Individual infonnation accumulation section. The variable con- 
dition input section provides the individual with an input interface based on the individual preference information. 
[0020] In one embodiment of the invention, the audio information provision system further includes an economic 

50 condition input section for receiving an economic condition representing a desired cost for the at least one audio in- 
formation stream; a mood information analysis section for analyzing the mood information and outputting a mood 
infonnation analysis result; and an individual Information accumulation section for accumulating the inherent condition, 
the mood infonnation analysis result and the economic condition. The selection section selects at least one audio 
infonnation stream from the plurality of audio information streams based on instruction information from a musicother- 

35 apist based on the inherent condition, the mood information analysis result and the economic condition. 

[0021] In one embodiment of the invention, the variable condition input section inputs impression information repre- 
senting an impression of an audio infonnation stream desired by the individual as the mood information. 
[0022] In one embodiment of the invention, the variable condition input section includes a display section. The var- 
iable condition input section provides the individual with a prescribed coordinate system which defines an impression 

^o of audio information streams through the display section. The impression information is input to the variable condition 
Input section by the individual specifying at least one point in the prescribed coordinate system. 
[0023] In one embodiment of the invention, the prescribed coordinate system includes a plurality of words represent- 
ing the impression. The plurality of words are changed in accordance with the type of audio infonnation stream desired 
by the individual. 

45 [0024] In one embodiment of the invention, the prescribed coordinate system has a plurality of image parts. 

[0025] In one embodiment of the invention, the impression Is represented by at least one of a word, a color and a 
symbol. 

[0026] Thus, the invention described herein makes possible the advantages of providing a system for selecting back- 
ground music suitable to the objective or image of commercial spaces such as department stores and other types of 
50 stores, public spaces such as hotels and offices, or private spaces where people can enjoy themselves such as auto- 
mobiles and houses. 

[0027] These and other advantages of the present invention will become apparent to those skilled in the art upon 
reading and understanding the following detailed description with reference to the accompanying figures. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[0028] 

Figure 1 shows an audio information provision system according to a first example of the present invention; 

Figure 2 shows an information delivery system using the audio information provision system according to the first 
example and a communication line; 

Figure 3 shows a structure of a selection section of the audio information provision system according to the first 
example; 

Figure 4 shows an audio information provision system according to a second example of the present invention; 

Figures 5A, 5B and 5C each show an input Interface used for inputting an inherent condition according to the first 
example; 

Figure 6 shows an Input interface used for Inpuning a variable condition according to the first example; 
Figure 7 shows an input interface used for inputting a reserved condition according to the first example; 
Figure 8 shows a structure of information stored in a database; 

Figure 9 shows a process for creating an impression space according to the first example; 

Figure 10 shows a process for obtaining a mapping function according to the first example; 

Figure 11 shows a process for performing mapping to the Impression space according to the first example; 

Figure 12 shows a component score coefficient for each space ID according to the first example; 

Figure 13 shows an impression space to which space IDs are mapped according to the first example; 

Figure 14 shows an impression space coordinate value for each space ID according to the first example; 

Figure 15 shows classification by time-and-day-of-the-week ID according to the first example; 

Figure 16 shows an Impression space to which time-and-day-of-the-week IDs are mapped according to the first 
example; 

Figure 17 shows classification by season ID according to the first example; 

Figure 18 shows an impression space to which season IDs are mapped according to the first example; 

Figure 19 shows classification by weather condition ID according to the first example; 

Figure 20 shows classification by weather condition ID according to the first example; 

Figure 21 shows adjustment information of each musical genre by space ID according to the first example; 

Figure 22 shows adjustment infomnation for related bibliographical information of each special space ID according 
to the first example; 

Figure 23 shows adjustment infomiation for related bibliographical infomnation of each special season ID according 

55 to the first example; 

Figure 24 shows a structure of a selection section of the audio Information provision system according to the 
second example; 
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Figure 25 shows an audio information provision system according to a third example of the present invention; 

Figure 26 shows an input interface used by a user for inputting inherent information, mood Information and eco- 
nomic condition to a terminal; 

5 

Figure 27 shows mood information input by the user and the analysis result provided by a mood information analysis 
section; 

Figure 28 shows another audio information provision section according to the third example of the present invention ; 

10 

Figure 29 shows an input interface used for inputting, to a satisfaction degree input section, whether the provided 
audio information was sampled, whether the user was satisfied with the provided audio information stream, and 
the image the user has on the audio information stream that he/she was not satisfied with; 

IS Figure 30 shows an Input Interface used for Inputting mood information to a variable condition Input section; 

Figure 31 shows still another audio information provision system according to the third example of the present 
Invention; 

20 Figure 32 shows an audio Information provision system according to a fourth example of the present Invention; 

Figure 33 shows an input interface of a variable condition input section and an economic condition input section 
used when the user demands musicotherapy; 

25 Figure 34 shows a coordinate system representing a music Image chart according to a fifth example of the present 

invention; 

Figure 35 shows classification of representative adjectives by music image according to the fifth example; 

30 Figure 36 shows the relationship between the musical features and the music image parts; 

Figure 37 shows another coordinate system representing a music image chart according to the fifth example; 

Figure 38 shows still another coordinate system representing a music image chart according to the fifth example; 
35 and 

Figure 39 shows still another coordinate system representing a music image chart according to the fifth example, 
DESCRIPTION OF THE EMBODIMENTS 

40 

[0029] Hereinafter, the present Invention will be described by way of Illustrative examples with reference to the ac- 
companying drawings. 

(Example 1) 

45 

[0030] Figure 1 shows an audio information provision system 1 00 according to a first example of the present Invention. 
In the first example, the audio information provision system 100 provides a commercial or public space with audio 
Information suitable thereto as, for example, background music. 

[0031] The audio infomnatlon provision system 100 Includes a database 130 storing a plurality of audio Information 
so streams, an Inherent condition input section 101 for receiving an Inherent condition which is Inherent to a commercial 
or public space, a variable condition input section 1 02 for receiving a variable condition which is variable in accordance 
with time, a selection section 120 for selecting at least one audio information stream from the plurality of audio infor- 
mation streams based at least on the inherent condition and the variable condition, an output section 1 40 for outputting 
the at least one audio information stream selected by the selection section 120, and a reserved condition input section 
55 103 for receiving a reserved condition which indicates that a preset audio Infonnation stream is output by the output 
section 140 at a preset time. 

[0032] The audio infonnation provision system 100 can be implemented in various forms, and elements of the audio 
information provision system 100 can be connected to each other in various fomns. For example, each element of the 
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audio information provision system 100 can be Implemented by hardware, software, or a combination of hardware and 

software. 

[0033] The selection section 1 20 can be connected to the inherent condition input section 1 01 , the variable condition 
input section 102, the reserved condition input section 103, and the output section 140 through a communication line 
(as shown in Figure 2). 

[0034] Figure 2 shows an exemplary Infomiation delivery system provided by the audio infonnation provision system 

100 using a communication line 153 according to the present invention. 

[0035] The information delivery system includes, for example, a temninal 151 used by a user in a commercial or 
public space 150 and a background music delivery center 154 of an infonnation service organization providing infor- 
mation to the terminal 151. In the commercial or public space 150, the audio infonnation is recorded and reproduced 
by the temriinal 151 and provided to the commercial or public space 150 through a reproduction device 152 as back- 
ground music. The database 130 and the selection section 120 are in the background music delivery center 154. The 
background music delivery center 154 manages a huge amount of audio information (for example, audio contents) 
stored in the database 130. The background music delivery center 154 and the terminal 151 transmit information to 
each other through a communication line 153. The communication line 153 can be, for example, a network, a wireless 
communication line or a wired communication line (for example, the Internet, a satellite communication line or a tele- 
phone line). 

[0036] The inherent condition input section 1 01 , the variable condition Input section 1 02, the reserved condition input 
section 103 and the output section 140 can be in the terminal 151. 

20 [0037] The terminal 151 can be, for example, a personal computer or a dedicated terminal device. 

[0038] In the case where the terminal 1 51 is a personal computer the user can input an inherent condition, a variable 
condition and a reserved condition to the terminal 151 using an input section such as a keyboard, a mouse, a touch 
pad or the like connected (wired or wireless) to the personal computer, while viewing a display 1 55. The user can also 
receive audio information from the reproduction device 152 connected to the personal computer 

25 [0039] In the case where the terminal 151 is a dedicated terminal device, each of the conditions can be input using 
the display 155 or the like incorporated therein. 

[0040] An "inherent condition" refers to a condition which is inherent to a target to which the audio information is 
provided. An inherent condition is, for example, an image based on the product concept, building, location or type of 
customers of a commercial space such as a store. 
30 [0041] Figures 5A. 5B and 5C each show an exemplary input interface used as the inherent condition input section 

101 (Figure 1) used by the user to input an inherent condition to the terminal 151. Input interfaces 111, 112 and 113 
respectively shown in Figures 5A, 5B and 5C are displayed on. for example, the screen of the display 155 of the 
terminal 1 51 . 

[0042] The input interfaces 111,112 and 113 are preferably user-friendly input interfaces which represent the image 

35 of the commercial or public space 150 with words or colors. 

[0043] The words representing the image of the commercial or public space 1 50 shown in the input interfaces 111, 
112 and 113 are "impression representing words" , which is unique to an audio infonnation provision system according 
to the present invention. The impression representing words can be selected using a mathematical technique such as 
factor analysis or principal component analysis from a plurality of words used by the music selecting specialists or 

40 store designers. 

[0044] The input interface 111 (Figure 5A) is designed to allow the user to input an image with an input bar including 
the impression representing words. The input interface 112 (Figure 5B) is designed to allow the user to input an image 
by specifying one point in a two-dimensional coordinate system including two perpendicular factors. The Input Interface 
112 can have a color map including colors instead of the impression representing words. 
45 [0045] The input interface 113 (Figure 5C) allows the user to input a specific name of a commercial or public space 
as an inherent condition. This dispenses the user from the need to consider the image of the store for providing the 
store with audio information suitable thereto. This also allows audio information which is certainly suitable for the store 
which It Is to be provided. 

[0046] A "variable condition" refers to a condition which varies in accordance with time. The variable condition can 
vary from moment to moment and is, for example, the season, date, time, weather, temperature, humidity, or crowd- 
edness. 

[0047] Figure 6 shows an exemplary input interface used as the variable condition input section 102 (Figure 1) used 
by the user to input a variable condition to the temriinal 151 . An input Interface 114 shown in Figure 6 is displayed on, 
for example, the screen of the display 1 55 of the terminal 1 51 . The time and date, and the day of the week are auto- 
matically displayed by the clock function of the personal computer or the dedicated terminal device acting as the temiinal 
151. The environmental data such as the weather, temperature and humidity can be automatically delivered by the 
communication function of the tenninal 151 . Thus, the user is dispensed from the trouble of inputting such data. 
[0048] A "reserved condition" refers to that a preset audio information stream is output by the output section 140 
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(Figure 1) at a preset time. When a reserved condition is input to the terminal 151, the output section 140 outputs the 
following predetermined audio information streams to the shoppers at the store at a predetermined lime: for example, 
the lime signal, a sound effect for promoting sales, a notice of time-limited discount or a notice of store opening or 
closing. The contents of the reserved condition (the type of audio information streams or time to output each audio 

5 information stream) can be set differently store to store. 

[0049] Figure 7 shows an exemplary input interface used as the reserved condition input section 103 (Figure 1) used 
by the user to input a reserved condition to the terminal 151. An input interface 115 shown in Figure 7 is displayed on, 
for example, the screen of the display 155 of the terminal 151. The input interface 115 allows the user to set the time 
to provide the store with the time signal or notice of store opening or store closing. The input interface 115 also allows 

^0 the user io:set the time to provide the sound effect for promoting sales or the notice of time-limited discount. The type 
of sound effect or the type of notice can be set by inputting a filename. 

[0050] The database 130 stores in advance a huge amount of audio information streams. Figure 8 shows the form 
of tho audio Information streams being stored. The database 130 stores audio Information streams MU(1) through ML) 
(n) and related information streams INFO(1) through INFO(n) respectively related to the audio Information streams 

'5 MU(1) through MU(n). Related Information stream INFO(n) includes, for example, a physical feature, an impression 
space coordinate value, bibliographic information or the like regarding each audio Information stream (for example, a 
song). A "physical feature" refers to the tempo, pitch, loudness, envelope or the like Inherent to a song. A physical 
feature is obtained by general acoustic signal processing such as, for example, frequency spectrum analysis, WIgner 
analysis, autocorrelation analysis or the like of the song. An "impression space coordinate value** refers to a value in 

20 a cooidinate system which represents an impression space determined in advance using the impression representing 
words inherent to the target space such as a commercial space. The coordinate system can define the impression of 
the audio information streams. "Bibliographical information" refers to the title, performer, formation of instruments, 
musical genre, time of composition of the song or the like. 

[0051] In this specification, the temi "song" is defined to refer to a tune with or without lyrics. 

25 [0052] Hereinafter, a method for obtaining an impression space coordinate value of an audio infonnation stream will 
be described. Figure 9 shows a process for creating an impression space for commercial spaces. First, a plurality of 
words representing the impression of commercial spaces are extracted. From the plurality of words, representative 
pairs of impression representing words used by unspecified people with high frequency such as, for example, "quiet - 
lively" and "luxurious - frugal" are selected. Each of these representative pairs of impression representing words is 

30 determined as a representative measure. Then, using an impression space including coordinate axes, each for one 
representative measure, a plurality of evaluators perfomri subjective evaluation of the images of the commercial spaces 
using a psychological evaluation technique such as the SD (Semantic Differential) method. The evaluation result is 
processed with a mathematical technique such as, for example, factor analysis or principal component analysis. Thus, 
from the above-mentioned impression space, another impression space having two coordinate axes perpendicular to 

35 each other is defined. Figure 13 shows an exemplary coordinate system which represents an impression space for 
the commercial spaces. 

[0053] Figure 10 shows a process for determining a mapping function for mapping songs, which are audio information 
streams, to the above-described coordinate system which represents the Impression space for the commercial spaces. 
A plurality of songs which are used as samples are processed with subject evaluation using the same measures and 

40 evaluators as used for the subject evaluation for the images of the commercial spaces. Then, from the analysis result 
and evaluation result obtained when the Impression space for the commercial spaces is defined, an impression space 
coordinate value of each of the plurality of songs used as samples in the coordinate system representing the Impression 
space for the commercial spaces is calculated. In parallel, physical features of the plurality of songs are calculated. 
From the relationship between the calculated coordinate values in the impression space and the calculated physical 
features, a mapping function for mapping a song to the coordinate system which represents the impression space for 
the commercial spaces is determined using a mathematical technique such as the multiple regression analysis or the 
like. Several ten to about a hundred songs can be used as samples for obtaining the mapping function. 
[0054] Figure 11 shows a process for mapping an enormous number of unknown songs to the coordinate system 
which represents the impression space for the commercial spaces. Once a physical feature of an unknown song is 

50 calculated, an impression space coordinate value of the unknown song is automatically calculated based on the map- 
ping function determined by the above-described process. Using the calculated impression space coordinate value, 
the unknown song can be mapped to the coordinate system which represents the impression space for the commercial 
spaces. The song of which the impression space coordinate value has been determined is stored in the database 130 
as an audio infonnation stream. The calculated physical feature and the calculated impression space coordinate value 

55 are also stored in the database 130 as related information streams. 

[0055] With reference to Figure 3, the selection section 120 also shown in Figure 1 will be described. The selection 
section 120 includes an inherent condition coordinate value calculation section 121, a variable condition coordinate 
value calculation section 122, a bibliographical information evaluation value calculation section 123. a total evaluation 
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value calculation section 124, and an audio information play list creation section 125. 

[0056] The inherent condition coordinate value calculation section 121 analyzes the inherent condition of the com- 
mercial space which has been input to the inherent condition input section 101 and detemiines the impression space 
coordinate value suitable to the inherent condition. The variable condition coordinate value calculation section 122 
analyzes the variable condition of the commercial space which has been input to the variable condition Input section 
102 and determines the impression space coordinate value suitable to the variable condition. The bibliographical in- 
formation evaluation value calculation section 123 outputs adjustment information to the total evaluation value calcu- 
lation section 124. The adjustment infonnation adjusts the probability at which an audio infomiation stream relating to 
at least either one of the inherent condition which has been input to the inherent condition input section 101 and the 
variable condition which has been input to the variable condition input section 102 is selected by the selection section 
120. 

[0057] The total evaluation value calculation section 124 analyzes the impression space coordinate value suitable 
to the inherent condition, the impression space coordinate value suitable to the variable condition, and the adjustment 
infomnation, and selects an audio infomiation stream from the database 130. The audio information play list creation 
?5 section 125 analyzes the audio infomiation stream selected by the total evaluation value calculation section 124 and 
the reserved condition which has been input to the reserved condition input section 103, and determines the order by 
which the plurality of audio information streams are to be output by the output section 140. Hereinafter, the operation 
of the selection section 120 will be described in more detail. 

[0058] Figure 12 shows a table representing results of subjective evaluation of the image of a plurality of commercial 
spaces. The table is used by the inherent condition coordinate value calculation section 121. In Figure 12, the space 
ID is used to identify a type of stores having a common representative image provided by the inherent condition input 
to the inherent condition input section 101. The subjective evaluation results are shown, for each space ID, by com- 
ponent score coefficients obtained by the SO method. For example, high quality store A represented by space ID 1 
obtained the following score coefficients for the following images: (quiet, active, luxurious, moderate price, traditional, 
casual, modern, bright) = (0.6, -0.5, 0.4. -0.6, 0.9, -0.4, -0.4, 0.2). Figure 14 shows an example of the impression space 
coordinate values in a two-dimensional coordinate system which represents the impression space for the commercial 
spaces. The impression space coordinate values are obtained based on the above-described results. Figure 13 shows 
two-dimensional coordinate system 160 representing the impression space for the commercial spaces. In the coordi- 
nate system 160, the impression space coordinate values of space IDs 1 through 7 are mapped. The center of each 
30 circle is the position represented by the impression space coordinate value of each of the spaces ID 1 through 7. Figure 
1 3 indicates that the images of the commercial spaces such as department stores and other stores can be mapped to 
the coordinate system 160. 

[0059] The process shown in Figure 3 can be summarized as follows. When the user inputs the image of a commercial 
space using, for example, the input interface 111 shown in Figure 5A, the component score coefficients shown in Figure 
12 are determined. Then, the impression space coordinate value is determined by the result of the principal component 
analysis. When the input interface 112 shown in Figure 5B is used, an impression space coordinate value is directly 
input. When the input interface 113 shown in Figure 5C is used, the name of a specific commercial space input and 
an impression space coordinate value shown in Figure 14 is determined based on the name. The impression space 
coordinate value is the inherent condition coordinate value which is calculated by the inherent condition coordinate 
40 value calculation section 121. Here, the inherent condition coordinate value is represented as "A". 

[0060] Figure 15 shows a table representing classification of a variable condition component of a commercial space 
regarding the time and the day of the week which has been input to the variable condition input section 102. The table 
is used by the variable condition coordinate value calculation section 122. In a commercial space, the type of customers 
and crowdedness are varied in accordance with the time and the day of the week. The time is classified as, for example, 
AM and PM, and the day of the week is classified as, for example, weekdays and weekend. In Figure 15. each clas- 
sification tag is provided with an ID (in parenthesis). For example, the time-and-day-of-the-week ID for Tuesday, 3 p. 
m. is WD2. Figure 16 shows a result of nine time-and-day-of-the-week IDs obtained by combining the classification 
tags in Figure 15 being mapped to the coordinate system 160, It is not necessary to map all of the time-and-day-of- 
the-week IDs to the coordinate system 160. 

[0061] Figure 17 shows a table representing classification of a variable condition component of a commercial space 
regarding the season which has been input to the variable condition input section 1 02. The table is used by the variable 
condition coordinate value calculation section 122. In a commercial space, the taste and actions of customers are 
varied in accordance with the season. The season is classified as, for example, spring, summer, etc., or by month. In 
Figure 17, each classification tag is provided with an ID. For example, the season ID for November 4 is AU2. Since 
some events occur in a particular season, special season IDs are also provided. For example, a period from April 1 to 
April 10 when schools hold entrance ceremonies is provided with the special season ID of SP2A. Figure 18 shows a 
result of the season IDs in Figure 17 being mapped to the coordinate system 160. It is not necessary to map all of the 
season IDs and all the special season IDs to the coordinate system 160. 
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[0062] Figure 19 shows a table representing classification of a variable condition component of a commercial space 
regarding the weather conditions which has been Input to the variable condition input section 102. The table is used 
by the variable condition coordinate value calculation section 122. In a commercial space, the taste and actions of 
customers are varied in accordance with the weather conditions. The weather conditions are classified by the elements 
5 such as. for example, weather, temperature, humidity, discomfort index, etc. In Figure 19, each classification tag is 
provided with an ID. For example, when the weather is fine, the temperature is 15*^0 and the humidity is 30%, the 
discomfort index is comfortable, and thus the weather condition ID is F300. The discomfort index Di Is obtained by 
expression (1) from the temperature T (*C) and the relative humidity U (%) as follows: 

10 

Di = 0.81T + 0.01U(0.99T - 14.3) + 46.3 (1). 



[0063] In this example, the discomfort index Di is classified into three stages of: comfortable, slightly uncomfortable 
and uncomfortable. Figure 20 shows a result of weather condition IDs obtained by combining the classification tags in 
Figure 19 being mapped to the coordinate system 160. It is not necessary to map all of the weather condition IDs to 
the coordinate system 160. 

[0064] The process perfomned by the variable condition coordinate value calculation section 122 shown in Figure 3 

will be further described. 

[0065] The user inputs a variable condition using the input interface 114 or the like shown in Figure 6 (alternatively, 
the time and date can be automatically set by a built-in function of the terminal 151, and the temperature and humidity 
can be delivered from the background music delivery center or the like as weather condition information which changes 
from moment to moment). Then, the time-and-day-of-the-week ID (Figure 15), the season ID or the special season ID 
(Figure 17), and the weather condition ID (Figure 19) are determined. Thus, the impression space coordinate value C 
of the time-and-day-of-the>week ID, the impression space coordinate value D of the season ID (or the special season 
ID), and the impression space coordinate value E of the weather condition ID are determined. The variable condition 
coordinate value B is determined by expression (2) as follows: 



B = cC + dD + eE (2). 

30 

[0066] The additions (+) performed in expression (2) indicate the following: when the positions represented by the 
coordinate :values C, D and E are in the same quadrant of the coordinate system, the variable condition coordinate 
value B is icalculated so as to be at the center of the three positions; and when the positions represented by the 
coordinate values C, D and E are in different quadrants of the coordinate system, the variable condition coordinate 
35 value B is calculated by performing vector calculation of the coordinate values C, D and E. Weighting coefficients c, d 
and e are determined in accordance with a prescribed rule. Which of the coordinate values C, D and E is to be the 
main element to calculate the variable condition coordinate value B can be adjusted by giving different values to the 
weighting coefficients c, d and e. 

[0067] The total evaluation value calculation section 124 shown in Figure 3 calculates a total evaluation value M, 
40 which is the final coordinate value of the target commercial space, using expression (3) based on the inherent condition 
coordinate value A and the variable condition coordinate value B. 



M = aA + bB (3) 

45 

[0068] Like in expression (2), the addition (+) performed in expression (3) indicates the following: when the positions 
represented by the coordinate values A and B are in the same quadrant of the coordinate system, the total evaluation 
value M is calculated so as to be at the center between the two positions; and when the positions represented by the 
coordinate values A and B are in different quadrants of the coordinate system, the total evaluation value M is calculated 

50 by perfonning vector calculation of the coordinate values A and B. In expression (3), a and b are weighting coefficients. 
[0069] The coordinate value represented by the total evaluation value M is the coordinate value of the target com- 
mercial space. Regarding the coordinate system 160, the total evaluation value calculation section 124 selects an 
audio infonnation stream corresponding to a related Information stream having coordinate values within a prescribed 
range (for example, one) from the coordinate value represented by the total evaluation value M. 

55 [0070] The adjustment infomnation which Is output by the bibliography infomnation evaluation value calculation sec- 
tion 123 is stored in advance in the related information stream INFO(n) shown in Figure 8. The adjustment information 
is created in advance using a meta data creation tool for the audio infomnation provision system according to the 
present invention, with reference to the bibliographical infomnation coresponding to the audio information stream. The 
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Adjustment information is determined for each space ID, each time-and-day-of-the-week ID, each season ID and each 
weather condition ID. For example, the adjustment information shows the following values: when the corresponding 
audio information is "never selected" as the background music for the target commercial space, 0 when the corre- 
sponding audio information is "absolutely selected". 1/2 when the corresponding audio information is suitable, and 2 
5 when the corresponding audio information is not very suitable. The adjustment infomnation acts as a "filter for preventing 
deviation from social commonsense" so that songs such as "Chanson de Tadieu" are never used in wedding reception 
houses. 

[0071] Figure 21 shows an example of adjustment information for each musical genre for each space ID. Figure 22 
shows an example of adjustment information for each special space ID. Figure 23 shows an example of adjustment 
10 infomnation for each season ID. 

[0072] The process performed by the bibliographical information evaluation value calculation section 123 shown in 
Figure 3 will be further described. 

[0073] The user inputs an inherent condition and a variable 

condition using the input interfaces 111,112,113 and 114 shown in Figures 5 A, 5B, 5C and 6. Then , the bibliographical 
15 intormation evaluation value calculation section 123 inquires to the database 130 about the adjustment information 
included in the related information stream INFO(n) shown in Figure 8 based on the space ID, the season ID, the time- 
and-day-of-the-week ID, the weather condition ID, and then retrieves an audio information stream related to the ad- 
justment infomnation stream corresponding to one of at least the inherent condition and the variable condition. The 
bibliographical information evaluation value H, which is the final adjustment information, is determined by expression 
20 (4) based on adjustment information F determined based on the bibliographical information related to the inherent 
condition of the audio information stream and adjustment infomnation G determined based on the bibliographical in- 
formation related to the variable condition of the audio information stream. 

2s H = fF + gG (4) 

[0074] In expression (4). f and g are weighting coefficients. The bibliographical information evaluation value H is 
output to the total evaluation value calculation section 1 24. The total evaluation value calculation section 1 24 multiplies 
the distance between the coordinate value assigned to the above-mentioned related audio information stream and the 

30 coordinate value represented by the total evaluation value M, with the bibliographical information evaluation value H, 
so as to adjust the distance. The distance is adjusted by changing the coordinate value assigned to the related audio 
information stream. When the bibliographical information evaluation value H is 0, the distance is 0 and therefore the 
related audio information stream is necessarily selected by the total evaluation value calculation section 124. When 
the bibliographical information evaluation value H is the distance is -i-oo and therefore the related audio information 

35 stream is never selected by the total evaluation value calculation section 124. 

[0075] The audio information play list creation section 125 shown in Figure 3 analyzes the audio information stream 
selected by the total evaluation value calculation section 124 and the reserved condition, and determines the order in 
which the audio information streams are output from the output section 140. The order can be determined in various 
manners in accordance with the objective. For example, the songs can be reproduced by the method of matching the 

40 tempo and loudness of the songs to the fluctuate curve of 1/f (f: frequency), to the human fatigue curve, or human 
biorhythm of the parasympathetic nervous system and the sympathetic nervous system. The audio information play 
list can be created in units of one hour or in units of one day. 

[0076] The output section 1 40 can output the audio information in accordance with the play list output from the audio 
information play list creation section 125 shown in Figure 3. 

45 

(Example 2) 

[0077] Figure 4 shows an audio information provision system 200 according to a second example of the present 
invention. In Figure 4, identical elements as those described above with reference to Figure 1 bear identical reference 

50 numerals therewith and detailed descriptions thereof will be omitted. 

[0078] The audio information provision system 200 includes an economic condition input section 104 in addition to 
the elements of the audio information provision system 100 shown in Figure 1. An economic condition input to the 
economic condition input section 104 represents a desired cost, for example, a budget of the audio information stream 
to be provided to the target. The economic condition which is input to the economic condition input section 1 04 is output 

55 to the audio information play list creation section 125 as shown in Figure 24. 

[0079] The audio information play list creation section 125 further selects audio information streams from the audio 
information streams selected by the total evaluation value calculation section 1 24 so that the cost is within the economic 
condition. From the further selected audio information streams and the audio information streams set based on the 
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reserved condition, an audio Information play list within the economic condition is created. For example, when an upper 
limit of 5000 yen is provided by the economic condition on the audio information play list created in units of one day, 
an audio information stream is created including upper level songs corresponding to the budget in the audio information 
play list selected by the total evaluation value calculation section 124 so that the total cost is within 5000 yen. 

5 

(Example 3) 

[0080] Figure 25 shows an audio information provision system 300 according to a third example of the present in- 
vention. In the third example, the audio information provision system 300 provides an individual (or user) with audio 

10 Information. The audio information provision system 300 includes an inherent condition input section 101 for receiving 
individual inherent information which is inherent to an individual as described further below, a variable condition input 
section 102 for receiving mood information which represents the mood of the individual, and an economic condition 
input section 104 for receiving a desired service cost desired by the individual. The audio infomnation provision system 
300 further includes a mood information analysis section 31 for analyzing the mood information which is output from 

»5 the variable condition input section 102 and then outputting the analysis result, an individual information accumulation 
section 32 for accumulating the individual inherent information output from the inherent condition input section 101, 
the analysis result output from the mood Information analysis section 32 and the desired service cost output from the 
economic condition input section 104, a database 130 for storing audio information streams, a selection section 120 
for performing a search in the database 130 based on the individual inherent information, the analysis result, and the 

20 economic condition and thus providing an audio infoiTnation stream which is suitable to the mood and the budget of 
the user, and an output section 140 for outputting the audio Infomriation stream selected by the selection section 120. 
The audio Information for individuals is mainly music. 

[0081] The individual information accumulation section 32 can be connected to the inherent condition input section 
1 01 , the mood information analysis section 31 , and the economic condition input section 1 04 through a communication 
25 line. The selection section 120 can be connected to the individual information accumulation section 32 also through a 
communication line. The output section 140 can be connected to the selection section 120 through a communication 
line. Each communication line can be an arbitrary network, such as, for example, the Internet. 

[0082] In the example shown in Figure 25, the inherent condition input section 101, the variable condition input 
section 102, the economic condition input section 104, the mood infonnation analysis section 31 , and the output section 

30 140 are included in the temriinal 151 (Figure 2). 

[0083] The individual information accumulation section 32 can be set in a control center having an accounting 
processing jf unction for counting the cost of the audio information streams provided to each user 
[0084] In. the example shown in Figure 25, the database 130 and the selection section 120 can be included in a 
content holder. The content holder manages a huge amount of audio information streams (contents) stored in the 

35 database 130. 

[0085] "Individual inherent information" refers to data which is inherent to the user. Examples of the individual inherent 
infonnation include the name, sex. date of birth, occupation, birthplace, family structure, musical experience, favorite 
music, and credit card number of the user. 

[0086] "Mood information" refers to data which represents the feeling of the user. Examples of the mood information 
^0 Include (i) data which represents the state of the user himself/herself such as the feeling, emotion and psychological 
condition of the user, and (ii) data which represents the nature of music such as the mood, image and genre of the 
music that the user wants to listen to at a certain time. When the user does not know which genre of music that he/ 
she wants to listen, it is not necessary to input the genre. It is preferable, though, to input the genre of the music that 
he/she wants to listen, in order to obtain music which is closer to the mood of the user. 
45 [0087] A "desired service cost' refers to the cost that the user is ready to pay in exchange of the audio Infonnation 
provision service . The user can input any amount of money as the desired service cost in consideration of their budget. 
The user can determine the desired service cost in accordance with the duration, number of songs or quality of the 
music provided. Alternatively, the user can determine the desired service cost in consideration of the effect provided 
by the music in accordance with the suitability of the music to his/her mood. Still alternatively, the user can determine 
so the desired service cost in consideration of the production cost of the music that the user assumes. 

[0088] The input interface used by the user to input the mood information is preferably a user-friendly input interface 
which represents the image of the music desired by the user with words or colors. 

[O089] The input interface can be a check box as shown in Figure 26, In addition to the input interfaces 111 (input 
bar), 112 (map) and 113 shown in Figures 5A, 5B and 5C. Figure 26 shows an input interface 116 which includes 
55 check boxes and allows the userto input detailed setting for individuals. The Input interface 1 16 also allows the individual 
inherent information and the desired service cost to be input to the terminal. The input interface 116 is displayed on, 
for example, the screen of the display 155 of the terminal 151 (Figure 2). 

[0090] The input Interface 116 including check boxes allows the user to input the mood infonnation by selecting the 
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words which represent the image of the music he/she desires. Such words are, for example, words representing the 
feelings such as "calm" or "cheerful and happy", words representing a location such as "southern" or "seaside", or 
words representing a color such as "red" or "blue". 

[0091] Using the Input Interface 116 shown in Figure 26, the user inputs the individual Inherent information (for 
5 example, text data representing the name of the user) and clicks the check box of the desired type of music in the 
musical genre check list (for example, the check box of "bossa nova"). The user also clicks the check box or boxes of 
the words representing the image of the desired music (for example, the check boxes of "calm", "cheerful and happy" 
and "southern"), and inputs the desired service cost. In this manner, the individual Inherent information is Input to the 
inherent condition input section 101 , the mood information is input to the variable condition input section 102, and the 
10 desired service cost is input to the economic condition input section 104. In this example, the musical genres that the 
user likes and does not like can be registered in advance as individual inherent information. 

[0092] The individual inherent information and the desired service cost which have been input are accumulated in 
the Individual Information accumulation section 32 together with the credit card number or the like. The mood Information 
which has been Input is analyzed by the mood Information analysis section 31. The analysis result is represented as 

15 values weighted by different coefficients for a plurality of different musical representation factors. 

[0093] Figure 27 shows an example of mood information input by the user and the analysis result provided by the 
mood information analysis section 31 . In the example shown in Figure 27, the mood information is data which represents 
the Image of the music desired by the user. A "musical representation factor" refers to a factor used to represent the 
nature of the music (for example, a cheerfulness factor or a sadness factor). The nature of the music is represented 

20 by values f(1), f(2), f(m) (where m is a total number of musical representation factors) weighted by different coeffi- 
cients for a plurality of different musical representation factors. 

[0094] In the following description, "values weighted by different coefficients for a plurality of different musical rep- 
resentation factors" will be referred to also as an "analysis result using musical representation factors". 
[0095] The mood information is transformed into an analysis result using musical representation factors in compliance 
25 with a mood representation rule. The mood representation rule is defined, in advance, by a table which transforms an 
image of music into values of musical representation factors by a psychological technique such as the SD method or 
the multi-dimensional scaling. 

[0096] The mood infonnatlon analysis section 31 outputs the analysis result using musical representation factors to 
the individual information accumulation section 32. 

30 [0097] The individual information accumulation section 32 accumulates the individual inherent information, the anal- 
ysis result of the mood information and the desired service cost as described above, and also sends information rep- 
resenting a selection condition (i.e., the desired service cost and the analysis result using musical representation 
factors) to the selection section 120. The analysis result of the mood infonnation and the desired service cost are 
accumulated in the individual information accumulation section 32 as at least a part of individual preference data which 

35 represents the taste of the user. The individual preference data is updated each time the analysis result of the mood 
infonnation and the desired service cost are input. 

[0098] The selection section 120 performs a search in the database 130 based on the desired service cost and the 
analysis result using musical representation factors. 

[0099] The structure of the audio infomnation streams stored in the database 1 30 is similar to that shown in Figure 8. 

40 [0100] Referring to Figure 8, the related information streams INFO{1) through INFO(n) each include bibliographical 
information (title of the song, perfonner, musical genre, formation of instruments, or the like), the result of analysis of 
the audio information streams using musical representation factors (i.e., values g(1), g(2), g(m) (where m is a total 
number of musical representation factors) weighted by different coefficients for a plurality of different musical repre- 
sentation factors), and a basic provision cost. 

^5 [0101] The "basic provision cost" refers to a basic cost which is calculated based on copyright managing cost, pro- 
duction cost and the like. 

[0102] The analysis results using musical representation factors included in the related information streams of the 
database 130 are obtained by analyzing the audio Information streams in a method similar to the method used by the 
information analysis section 31. 

50 [0103] The selection section 120 calculates a sum S of the absolute values of the differences between the analysis 
results using musical representation factors provided by the mood Information analysis section 31 (I.e., the values f 
(1 ), f(2). .... f (m) weighted by different coefficients for a plurality of different musical representation factors) and analysis 
results using musical representation factors included in the related infomnation streams of the database 130 (i.e., the 
values g(1), g(2), g(m) weighted by different coefficients for a plurality of different musical representation factors) 

55 in accordance with expression (5). The above-mentioned sum S will be referred to as a "difference S", hereinafter. 

S = Zlf(l)-g(i)l(i=1.2. ....m) (5) 
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[01 04] The selection section 1 20 outputs audio information streams corresponding to the related information streams, 
as the selection result. The audio Information streams are output In the order starting from the audio information stream 
corresponding to the smallest difference S. As the selection result, a single audio information stream can be output, 
or a plurality of audio Information streams can be output. The number of audio information streams which are output 
as the selection result is determined in a manner described below. 

[01 05] The selection section 1 20 adds an adaptation cost to the basic provision cost of each audio Infomriation stream. 
The "adaptation cost" Is obtained by multiplying the basic provision cost by an adaptation ratio R. The adaptation ratio 
R increases as the difference S is smaller (i.e., the accuracy of the selection result with respect to the audio information 
stream demanded by the user is higher). It should be noted that the upper limit of the adaptation ratio R is specified. 
Alternatively, the upper limit of the adaptation ratio R can be automatically determined based on the number of audio 
infomnation:streams provided as the selection result, the basic provision cost, and the desired service cost within a 
range of, for example, ±10% (the margin can be freely determined by the music providing side, for example, the content 
holder). - 

[01 06] The number of audio information streams wh ich are output as the selection result is determined in accordance 
with the desired service cost. Audio infomriation streams are output until the grand total of the total costs exceeds the 
desired service cost. The total cost of each audio information stream is the sum of the basic provision cost and the 
adaptation cost. In this manner, at least one audio Infomriation stream Is output as the selection result. Even an identical 
song may cost differently to different individuals when the adaptation ratio is different. 

[01 07] Table 1 show an exemplary selection result provided by the selection section 1 20. In this example, the desired 
seivice cost Is 500 yen, and the upper limit of the adaptation ratio R Is 25%. 



Table 1 





Music file No. 


Basic provision cost (yen) 


Difference S 


Adaptation ratio R (%) 


Provision cost (yen) 


1. 


#00011 


100 


0.11 


25 


125 


2. 


#03770 


100 


0.19 


20 


120 


3. 


#00462 


200 


0.25 


15 


230 


4. 


#09944 


150 


0.26 


10 


165 



25 



30 



35 



40 



45 



50 



55 



[0108] The total cost of three songs counted from the song corresponding to the smallest difference S is 475 yen, 
which is less than the desired service cost of 500 yen. The total cost of four songs counted from the smallest difference 
S is 640 yen, which exceeds the desired service cost of 500 yen. Therefore, the selection section 120 outputs the 
upper three songs (i.e., music file numbers #00011, #03770 and #00462). 

[0109] Due to such a system, according to the audio Information provision system 300, even an identical audio 
infonnation streams is purchased at different costs by each individual. 

[01 1 0] The audio information stream output from the selection section 1 20 is provided to the user through the output 
section 140. 

[01 1 1 ] It is preferable to adopt a system of allowing the user to listen to the audio Information stream for a prescribed 
time period (for example, 45 seconds) free of charge so that the user feeds back to the audio Information provision 
system whether the user is satisfied with the provided audio infonnation stream. 

[0112] Figure 28 shows an audio information provision system 400 having such a feedback function. In Figure 28, 
identical elements as those described above with reference to Figure 25 bear identical reference numerals therewith 
and detailed descriptions thereof will be omitted. 

[0113] The audio Infomriation provision system 400 includes a satisfaction degree information input section 105 and 
a feedback section 36 in addition to the elements shown in Figure 25. 

[0114] The satisfaction degree information input section 105 is structured so that the user can input information 
indicating whether the user is satisfied with the provided audio Information stream. 

[0115] More specifically, the user can sample the provided audio Infomriation stream and then input satisfaction 
degree information, which Indicates whether the user is satisfied with the provided audio information stream, to the 
satisfaction degree information input section 105. When the user inputs information indicating that "he/she is satisfied 
with the provided audio information stream" to the satisfaction degree information input section 105, such information 
is provided to the individual information accumulation section 32. 

[0116] It Is preferable that the Individual Information accumulation section 32 notifies the accounting section to bill 
the user only when it has received the information indicating that "the user is satisfied with the provided audio information 
stream". Thus, the user is not billed until the user is satisfied. 

[01 1 7] When the user inputs information indicating that "the user is not satisfied with the provided audio information 
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stream" to the satisfaction degree information input section 105, such information is provided to the individual infor- 
mation accumulation section 32. In this case, it is preferable that the user also inputs the image he/she has on the 
audio information stream that he/she is not satisfied with, to the satisfaction degree information input section 105. Thus, 
the satisfaction degree of the user (or how much the provided audio information stream matches the mood of the user 

5 and the budget) can be fed back to the audio information provision system 400. 

[01 18] Figure 29 shows an exemplary input interface 117 used to input, to the satisfaction degree information input 
section 105, (i) whether the provided audio information was sampled or not, (ii) whether the user was satisfied with the 
provided audio infonmation stream, and (iii) the image the user has on the audio information stream that he/she was 
not satisfied with. The input interface 117 is displayed on, for example, the screen of a display of the terminal (not 

10 shown). 

[0119] Using the input interface shown in Figure 29, the user inputs the image he/she has on the audio information 
stream that he/she was not satisfied with, to the satisfaction degree information input section 105. In the example 
shown in Figure 29, the images of the music are represented by the same words as those used in the input interface 
116 shown in Figure 26. These images of the music are transformed into values weighted by different coefficients for 
'5 a plurality of different musical representation factors (analysis result using musical representation factors). 

[0120] The analysis result using musical representation factors which has been input to the satisfaction degree in- 
formation input section 105 is sent to the individual infomnation accumulation section 32. 

[0121] The individual information accumulation section 32 updates the analysts result using musical representation 
factors and also outputs the updated selection condition to the selection section 120. By updating the analysis result 
20 using musical representation factors accumulated in the individual information accumulation section 32. the precision 
of the analysis result using musical representation factors improves as the same user continues to use the audio 
information provision system 400 overtime. As a result, the individual adaptability to that user is improved. 
[0122] The selection section 120 perfomns another search in the database 130 based on the updated selection 
condition. 

25 [0123] In this manner, the satisfaction degree of the user (or how much the provided audio infomriation stream match- 
es the mood of the user and the budget) can be fed back to the audio information provision system 400. 
[0124] Returning to Figure 28, the individual information accumulation section 32 accumulates the past selection 
results in addition to the individual inherent infonnation, mood infomnation. and the desired sen^ice cost. 
[0125] The feedback section 38 refers to the past selection results accumulated in the individual information accu- 

30 mulation section 32 as individual preference data and notifies the individual preference data to the variable condition 
input section 102. 

[0126] The variable condition input section 102 includes a plurality of input interfaces. The variable condition input 
section 102 is designed to provide the user with an input interface corresponding to the individual preference data 
notified by the feedback section 36 among the plurality of input interfaces. 
35 [0127] Figure 30 shows an exemplary input interface 118 used to input mood infomnation to the variable condition 
input section 102. The input interface 118 show in Figure 30 is displayed on, for example, the screen of a display of 
the terminal (not shown). 

[0128] In the example shown in Figure 30, it is assumed that an audio information stream in the musical genre of 
bossa nova was provided to the user in an immediately previous selection. The genre of the audio information stream 
40 -provided to the user in the immediately previous selection can be found by referring to the past selection results ac- 
cumulated in the individual information accumulation section 32. 

[0129] The feedback section 36 refers to the past selection results accumulated in the individual information accu- 
mulation section 32, and controls the input interface in the variable condition input section 102 based on the past 
selection results. As a result, as shown in Figure 30, check boxes for Latin music genres similar to bossa nova obtained 

^5 in the immediately previous selection (for example, Afro-Cuban, salsa, rumba or the like) are added. 

[0130] The variable condition input section 102 can have an input interface usable to input information representing 
musical elements (for example, rhythm, key, tempo, beat and the like). When the user has knowledge of music, the 
user can input mood information using the input interface representing the musical elements. Thus, mood information 
having a higher adaptability can be input. 

50 [0131] Figure 31 shows an audio information provision system 500. In Figure 31, identical elements as those de- 
scribed above with reference to Figure 25 bear identical reference numerals therewith and detailed descriptions thereof 
will be omitted. 

[0132] The audio information provision system 500 includes an audio information processing section 37 in addition 
to the elements shown in Figure 25. 
55 [0133] The audio information processing section 37 transfomis information representing musical elements (for ex- 
ample, rhythm, key, tempo, beat and the like) into a file fomiat on the database 130 and sends the transformed infor- 
mation to the selection section 120. The selection section 120 selects and outputs audio information streams as de- 
scribed above. 
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(Example 4) 

[0134] Figure 32 shows an audio information provision system 600 according to a fourth example of the present 
invention. In Figure 32, identical elements as those described above with reference to Figure 25 bear identical reference 
numerals therewith and detailed descriptions thereof will be omitted. 

[0135] When the user inputs individual inherent information, mood information and a desired service cost in the hope 
of obtaining specialized musicotherapy, the input data is sent to the individual infonnation accumulation section 32 In 
the control center. The control center accumulates the input data in the individual information accumulation section 32 
and, when necessary, sends the input data to an individual information accumulation section 39 in a musicotherapy 
association to which musicotherapists are registered. The data sent the musicotherapy association is accumulated in 
the individual information accumulation section 39. The individual information accumulation section 39 can be con- 
nected to the individual infomnation accumulation section 32 through an arbitrary type of communication line. 
[0136] In this case, the variable condition input section 102 provides the user with an input interface which is similar 
to a medical examination questionnaire in which the user is to describe his/her physical and mental states. The eco- 
nomic condition input section 104 provides the user with an input interface which allows the user to select a time period 
and a cost of one session. 

[0137] Figure 33 shows an input interface 119 provided by the variable condition input section 102 and the economic 
condition input section 104 when the user demands musicotherapy. 

[01 38] A musicolherapisl analyzes the data accumulated in the individual information accumulation section 39 based 
on expert knowledge and inputs the analysis result (for example, data which indicates what type of music is suitable) 
to a music infomnation processing section 38. The music infomnation processing section 38 is included in, for example, 
the terminal 151 (Figure 2). The musicotherapist generally has knowledge that, for example, "the first movement of 
Mozart's Symfonia Concertante is effective to an insomniac". Therefore, the musicotherapist inputs instruction infor- 
mation that "look for Mozart's Symfonia Concertante and songs similar thereto" to the music information processing 
^5 section 38 in order to provide the insomniac with a suitable audio information stream. 

[0139] The music information processing section 38 performs acoustic signal analysis such as frequency spectrum 
analysis, Wigner analysis, autocorrelation analysis or the like of the designated song, and thus extracts musical physical 
features such as the tempo, pitch, loudness, envelope, sound features and the like. Then, the music information 
processing section 38 sends these musical physical features as an instruction information processing result to the 
selection section 120. The selection section 120 can be connected to the music infonnation processing section 38 
through an arbitrary type of communication line. 
[0140] Based on the instruction information processing result, the selection section 120 performs a search in the 
database 130. The selection section 120 selects and outputs audio information streams as described above. 
[0141] Such a service can select and provide audio information streams which are suitable to various states and 
35 various types of mood of the user at a cost desired by the user. 

[0142] In the information communication society of today, an enormous number of people have physical and mental 
stress. The audio information provision system 600 in the fourth example can select and provide music which is suitable 
to each feeling or each physical and mental state so as to encourage and heal these people. Especially, songs for 
musicotherapy have conventionally been selected based on knowledge from psychiatric counselors and therapists. 
According to the audio information provision system 600 of the present invention, a great number of songs suitable to 
the physical and mental states of patients can be easily selected and provided in a short time period. 

(Example 5) 



30 
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45 
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55 



[0143] Figure 34 shows a coordinate system 700 which is a music image chart in a fifth example of the present 
invention. The coordinate system 700 is provided to the user as an input interface provided by the variable condition 
input section 102 of the above-described examples. The coordinate system 700 is displayed on, for example, the 
screen of the display 155 (Figure 2). The coordinate system 700 represents an impression space used for locating 
images of each individual audio infonnation stream belonging to pop and popular music genres on a two-dimensional 
coordinate system. Coordinate axes 701 are representative factor axes for the pop and popular music genres. The 
vertical axis is a still-active axis, and the horizontal axis is a rhythm-melody axis. Reference numeral 702 represents 
image parts which indicate how images of the pop and popular music are classified. When an individual specifies at 
least one point in the coordinate system 700, impression information representing the impression of an audio informa- 
tion stream is input to the variable condition input section 102. 

[0144] Each image part includes the following adjectives (A) through (I) which are used by the sampler of an audio 
information stream for representing images of the music. 

(A) Adjectives representing calmness: calm, mild, carefree, ingenuous, soft 
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(B) Adjectives representing degree of sentimentality: romantic, sentimental, deep, dramatic 

(C) Adjectives representing naturalness: natural, stable, neutral, monotonous, simple 

(D) Adjectives representing light-footedness: 

5 light-footed, refreshing, clear-cut 

(E) Adjectives representing curiousness: 
mysterious, unique, curious 

10 

(F) Adjectives representing dynamicalness: 



vigorous, high-spirited, dynamic, vital, active, pop 

^5 (G) Adjectives representing tenseness: sharp, tense, exciting, cool, tight 

(H) Adjectives representing intensiveness: 



violent, sweltering, powerful, energetic, wild, noisy, lively, boisterous, electric, mechanical, dashing 



20 (I) Adjectives representing sophistication: 

danceable, urban, stylish, sophisticated 



[0145] The images of each audio information stream are associated with musical features such as the tempo, fre- 
25 quency characteristics, formation of instruments, pitch, fluctuation of tempo, and the like. Therefore, all the genres of 
pop and popular music are mapped on the music image chart by the classification by the musical features and the 
classification of the Image. 

[0146] The coordinate system Is created and mapping is performed in basically the same process as that described 
in the first example. Hereinafter, a process for creating the coordinate system and the performing of mapping will be 

30 described. 

[0147] The adjectives representing images of music, the image parts and the representative factor axis are deter- 
mined using the foltowing psychological techniques. First, a psychological technique referred to as the sound source 
description selection method Is used. The sound source description selection method selects representative adjectives, 
representing audio Information streams, from language data which unspecified people associate to Images perceived 

35 When actually sampling the audio information streams. Consideration of frequency of use of the adjectives and semantic 
associations of the adjectives with the images Is included In the sound source description selection method. Then, a 
psychological technique such as the SD method or the like is used to perform multiple-stage evaluation of the Images 
of the audio infomnation streams. As the multiple-stage evaluation, five-stage or seven-stage evaluation is typically 
used. From the result of the multiple-stage evaluation, psychological feature of each audio information stream is ob- 

40 talned. Representative factors are determined by factor analysis such as principal component analysis or the like. 
Representative factors are selected from the representative adjectives so that the total of the evaluated suitability 
degree of each adjective is equal to or greater than 75%. When two factors amount to less than 75%, three factors are 
used as representative factors. On the two-dimensional plan having the representative factor axes as X and Y axes, 
the psychological feature is mapped. Thus, the image chart is created. Figure 35 shows classification of adjectives 

45 which are described highly frequently regarding pop and popular music. 

[0148] Figure 36 shows the relationship between the musical features of audio Information streams and image parts. 
In the case of audio information stream 1 , the tempo is 140, the amplitude is large in a low frequency range and a high 
frequency range, the average pitch is 2500 Hz, and the fluctuation of the tempo is l/f^. In this case, the image part is 
Hh. In the case of audio information stream 2, the tempo is 70, the amplitude gradually decreases as the frequency 

50 increases, the average pitch Is 800 Hz, and the fluctuation of tempo is 1/f. In this case, the Image part is Bb. Such 
classification of the image part can be made by an expert familiar with music or by automatic processing. Once all the 
genres of pop and popular music are mapped on the music image chart, the music image chart can be used as an 
input interface for searching for audio information streams providing a certain Image orfor selecting songs. Thus, songs 
having a particular image associated to the user can be more likely and Intuitively retrieved or selected. 

55 [0149] The music image chart can be used In order to present search results or song selection results. 

[01 50] Figure 37 shows a coordinate system 71 0 which is an exemplary modification of the coordinate system 700. 
In the coordinate system 710 shown in Figure 37, colors associated with words are used for the image parts. 
^ [0151] Figure 38 shows a coordinate system 720 which is an exemplary modification of the coordinate system 710 
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shown in Figure 37. In the coordinate system 720 shown in Figure 38, the shapes and colors of coordinate points 
representing audio infornnatlon streams mapped on the Image parts can be altered in accordance with the Image. For 
example, the coordinate point representing the audio information stream on an image part 721 representing the degree 
of sentimentality can be a heart-shaped symbol and pink. The coordinate point representing the audio information 
stream on an image part 722 representing the tenseness can be a triangular symbol and black. Alternatively, the size 
of a coordinate point can be changed in accordance with the strength of the image. On the same image part, acoordinate 
point representing an audio information stream giving a stronger image can be larger (see, for example, an image part 
723). Thus, the user can intuitively associate an audio infonnation stream with an image. 

[0152] Figure 39 shows a coordinate system 730 which is another exemplary modification of the coordinate system 
700 shown in Figure 34. The coordinate system 730 shows an exemplary image chart created for classical music. The 
shapes and sizes of the image parts and the adjectives representing the images of the music are changed to be suitable 
to classical music from those used in the coordinate system 700 for pop and popular music. By using the adjectives 
representing images of the music which are more suitable to the type of audio information streams desired by each 
individual, an image of the audio information streams desired by the individual is more easily obtained. 
[01 53] According to the present invention, an audio information stream is selected from a plurality of audio information 
streams based on the inherent condition of the target to which an audio information stream is to be provided and the 
vanable condition which changes in accordance with time. Thus, an audio infonnation stream fulfilling both of the 
Inherent condition and the variable condition can be provided. 

[0154] According lo the present invention, music which is suitable to the objective, image, change in accordance 
with time and change in accordance with environment of commercial spaces, public spaces and private spaces where 
people can enjoy themselves can be selected by a simple method in a short time period. Thus, the cost which is 
conventionally quite high by reliance on specialists can be reduced, and music suitable to each listener can be provided 
from a wide variety of selections. 

[0155] According to the present invention, use of a music image chart as an Input interface provided by the variable 
condition input section allows the user to intuitively retrieve and select audio information streams which are best suited 
to his/her mood. 

[0156] Various other modifications will be apparent to and can be readily made by those skilled in the art without 
departing from the scope and spirit of this invention. Accordingly, it is not intended that the scope of the claims appended 
hereto be limited to the description as set forth herein, but rather that the claims be broadly construed. 



Claims 

1 . An audio information provision system for providing a target with an audio infonnation stream suitable to the target, 
comprising: 



a database for storing a plurality of audio information streams; 

an inherent condition input section for receiving an inherent condition of the target; 

a variable condition input section for receiving a variable condition varying in accordance with time; 

a selection section for selecting at least one audio Information stream from the plurality of audio Information 

streams based on at least the inherent condition and the variable condition; and 

an output section for outputting the at least one audio information stream. 

2. An audio information provision system according to claim 1 , wherein: 

45 

the database stores a plurality of related information streams respectively related to the plurality of audio 
Information streams, 

each of the related information streams has a coordinate value representing a position of the corresponding 
audio information stream In a prescribed coordinate system which defines an impression of audio information 
50 streams, 

the selection section determines a coordinate value representing a position of the target in the prescribed 
coordinate system based on the inherent condition and the variable condition, and selects at least one audio 
information stream corresponding to at least one related infonnation stream having a coordinate value which 
is located within a prescribed range from the coordinate value representing the position of the target. 

55 

3. An audio information provision system according to claim 2, wherein: 

at least one related infonnation stream among the plurality of related information streams includes at least 
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one adjustment infonrnatlon stream which indicates that a distance between a coordinate value included in the 
at least one related information stream and the coordinate value of the target is adjusted, based on at least 
one of the inherent condition and the variable condition, and 

the selection section changes the coordinate value included in the at least one related infonnation stream. 

5 

4. An audio information provision system according to claim 1 , further comprising a reserved condition input section 
for receiving a reserved condition indicating that a preset audio information stream is output by the output section 
at a preset lime, wherein the output section outputs the preset audio information stream at the preset time. 

10 5. An audio information provision system according to claim 1 , further comprising an economic condition input section 
for receiving an economic condition representing a desired cost for the at least one audio information stream, 
wherein the selection section selects at least one audio information stream, based on the economic condition, 
from among the at least one audio information stream selected from the plurality of audio information streams 
based on the inherent condition and the variable condition. 

15 

6. An audio information provision system according to claim 2, wherein the plurality of related information streams 
further include a plurality of physical feature infomnation streams each representing a physical feature of the cor- 
responding audio information stream of the plurality of audio infonnation streams and a plurality of bibliographical 
information streams each representing a bibliography of the corresponding audio information stream of the plurality 

20 of audio information streams. 

7. An audio information provision system according to claim 1 , wherein the selection section is connected to the 
inherent condition input section, the variable condition input section and the output section via a communication 
line. 



25 



30 



8. An audio information provision system according to claim 1 . wherein the target is one of a commercial space and 
a public space. 

9. An audio information provision system according to claim 1 . wherein: 



the target is an individual, 

the inherent condition represents inherent information of the individual, and 
the variable condition represents mood information of the individual. 

35 10. An audio information provision system according to claim 9, further comprising: 

an economic condition input section for receiving an economic condition representing a desired cost for the 
at least one audio information stream; 

a mood information analysis section for analyzing the mood infonnation and outputting a mood information 
^0 analysis result; and 

an individual information accumulation section for accumulating the inherent infonnation, the mood information 
analysis result and the economic condition, 

wherein the selection section selects at least one audio information stream, based on the economic condition, 
^5 from among the at least one audio information stream selected from the plurality of audio information streams 

based on the Inherent information and the mood infonnation analysis result. 

11. An audio information provision system according to claim 10, wherein: 

^0 the mood information analysis result and the economic condition are accumulated in the individual information 

accumulation section as individual preference information representing an individual preference, and 
the individual preference information is updated each time the mood infonnation analysis result and the eco- 
nomic condition are input to the Individual infomnation accumulation section. 

55 12. An audio information provision system according to claim 9, further comprising a satisfaction degree information 
input section for receiving satisfaction degree information representing a satisfaction degree of the individual for 
the at least one audio information stream. 
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13. An audio information provision system according to claim 10, wherein: 



the individual information accumulation section accumulates a past selection result provided by the selection 

section. 

5 the audio information provision system further comprises a feedback section for presenting to variable condi- 

tion input section, as individual preference information representing an individual preference, the past selection 
result accumulated in the individual information accumulation section, and 

the variable condition input section provides the individual with an input interface based on the individual 
preference information. 

10 

14- An audio information provision system according to claim 9, further comprising: 

an. economic condition input section for receiving an economic condition representing a desired cost for the 
at least one audio information stream; 

a mood information analysis section for analyzing the mood information and outputting a mood Information 
analysis result; and 

an individual information accumulation section foraccumulating the inherent Information, the mood information 
analysis result and the economic condition, 



wherein the selection section selects at least one audio information stream from the plurality of audio infor- 
mation streams based on Instruction information from a musicotheraplst based on the inherent Information, the 
mood infomiation analysis result and the economic condition. 

15. An audio infomnation provision system according to claim 9, wherein the variable condition input section inputs 
impression information representing an impression of an audio information stream desired by the Individual as the 
mood information. 



16. An audio information provision system according to claim 15, wherein: 

the variable condition input section Includes a display section, 

the variable condition input section provides the individual with a prescribed coordinate system which defines 
an impression of audio information streams through the display section, and 

the impression infomnation is input to the variable condition input section by the individual specifying at least 
one point in the prescribed coordinate system, 

35 

17. An audio information provision system according to claim 16, wherein: 

the prescribed coordinate system includes a plurality of words representing the impression, and 
the plurality of words are changed in accordance with the type of audio information stream desired by the 
"^o individual. 



18. An audio information provision system according to claim 16, wherein the prescribed coordinate system has a 
plurality of image parts. 

19. An audio Information provision system according to claim 16, wherein the Impression Is represented by at least 
one of a word, a color and a symbol. 
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FIG. 7 
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Season ID 


Related bibliographical information 
' 


Adjustment 
information 


SU2 


Singers associated with summer 




bands associated with summer 


0 


Summer, sea, sun 




Hot, tignting spirit, tropics 




SPiA 


Girls Festival 


0 


SUiA 


Carp streamer 


0 


AUiA 


Athletics meet, physical training 


0 


ml, WI2 


Singers associated with winter 


1/2 


ll/Ti A 

WliA 


Christmas 


0 


WI2A 


An nnnp^p mi i^ir^ 


1/2 


Early spring, new year, beginning of the year 


0 


WIIB 


"Auld Lang Syne" 


0 


WIIB 


"Symphony No. 9" by Beethoven 


0 


SP1B 


Songs ossociated with spring 


1/2 


W01. WEI 


Morning, waking 


1/2 


HD1 


Night, midnight 


+ 00 


WD2, WD3 


Morning .waking 


+ 00 


WE2, WE3 


• • » 




HD2, HD3 


• • • 
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The following songs will satisfy your demands within your budget 
of 500 yen. ^ 



Title 

1. Song M 

2. Song Mn 

3. Song MM 



Performer 

Singer S 
Player Ss 
Singer SS 



Duration 

5'30" 
4'30" 
5'00" 



Do you want to sample them? 

0 Yes 
□ No 

If you do not like some of the songs after sampling them, 
they will be removed from the list. We will provide different 
songs after searching again. 



1. 

2- 



Title 

Song M 
Song Mn 



Satisfied 



s 



Not satisfied 
0 



3. Song MM 

Tell us how strongly you feel Song M fits the following images. 



Calm 

Bright and happy 
Southern 



Very strong . 



Medium 
1 



Veiy weak 

+V — 



+ 



+ 



+ 



+ 



+ 
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We provided you with the following bossa nova songs the lost time. 
Title Performer Duration 
1- Song M Singer S 5'30" 

2. Song Mn Player Ss 4'30" 

3. Song MM Singer SS 5'00" 

Do you wont songs other than the above? 

0 Yes 
□ No 

Latin genres similar to bossa nova you desired the last time 
are added to your list 



Musical genre 
check list 

□ Bossa nova 

□ Classic 

□ Jazz 

□ Pop (Western) 
O Pop (Japanese) 
Q New age 

□ Pop (Asian) 

□ Screen music 



Musical genre check 
list added 

□ Afro-Cuban 

□ Son 

O Habanera 

□ Samba 
O Rumba 

□ Salsa 
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FIQ. 35 



1 


Powerful, violent, noisy, dashing, wild 
Calm. mild, carefree 


2 


Vigorous, pop, dynamic, high-spirited, vital, active 
Non-viqorous. deep, sentimental, romnntir 


3 


Refreshinq. liaht-footed. clear-out 
Swelterinq. boisteroust lively 


4 


Natural, simple, stable, neutral, monotonous 
Dramatic, urban, sophisticated, stylish 


5 


Sharp, cool, tight, tense, exciting 
Soft 
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Audio information 
stream 1 


Audio information 
stream 2 


Tempo 


140 


70 


Frequency 
characteristics 










Formation of 
instruments 


8 

Electric base, 
2 electric guitars, 
drums, percussion, 
synthesizer, vocal, 
trumpet 


2 

Piano, tenor sax 


Average pitch 


2500 


800 


Fluctuation of 
tempo 


1/f2 


1/f 


Image part 


Hh 


Bb 
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